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Abstract 

We consider Pareto optimal matchings (POMs) in a many-to-many market of ap¬ 
plicants and courses where applicants have preferences, which may include ties, over 
individual courses and lexicographic preferences over sets of courses. Since this is the 
most general setting examined so far in the literature, our work unifies and generalizes 
several known results. Specifically, we characterize POMs and introduce the Gener¬ 
alized Serial Dictatorship Mechanism with Ties (GSDT) that effectively handles ties 
via properties of network flows. We show that GSDT can generate all POMs using 
different priority orderings over the applicants, but it satisfies truthfulness only for 
certain such orderings. This shortcoming is not specific to our mechanism; we show 
that any mechanism generating all POMs in our setting is prone to strategic manip¬ 
ulation. This is in contrast to the one-to-one case (with or without ties), for which 
truthful mechanisms generating all POMs do exist. 

Keywords: Pareto optimality, many-to-many matching, serial dictatorship, truthfulness 

1 Introduction 

We study a many-to-many matching market that involves two finite disjoint sets, a set 
of applicants and a set of courses. Each applicant finds a subset of courses acceptable 
and has a preference ordering, not necessarily strict, over these courses. Courses do not 
have preferences. Moreover, each applicant has a quota on the number of courses she can 
attend, while each course has a quota on the number of applicants it can admit. 
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463 (Oceiakova) and OTKA grant K108383 (Fleiner). The authors gratefully acknowledge the support of 
COST Action IC1205 Computational Social Choice. A preliminary version of this paper has been published 
in the proceedings of the 8th International Symposium on Algorithmic Game Theory (SAGT 2015) [7]. 
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A matching is a set of applicant-course pairs such that each applicant is paired only 
with acceptable courses and the quotas associated with the applicants and the courses 
are respected. The problem of finding an “optimal” matching given the above market 
is called the Course Allocation problem (CA). Although various optimality criteria exist, 
Pareto optimality (or Pareto efficiency) remains the most popular one (see, e.g., [1, 2, 9, 
8, 19]). Pareto optimality is a fundamental concept that economists regard as a minimal 
requirement for a “reasonable” outcome of a mechanism. A matching is a Pareto optimal 
matching (POM) if there is no other matching in which no applicant is worse off and 
at least one applicant is better off. Our work examines Pareto optimal many-to-many 
matchings in the setting where applicants’ preferences may include ties. 

In the special case where each applicant and course has quota equal to one, our setting 
reduces to the extensively studied House Allocation problem (HA) [15, 1, 21], also known 
as the Assignment problem [12, 4]. Computational aspects of HA have been examined 
thoroughly [2, 18] and particularly for the case where applicants’ preferences are strict. 
In [2] the authors provide a characterization of POMs in the case of strict preferences 
and utilize it in order to construct polynomial-time algorithms for checking whether a 
given matching is a POM and for finding a POM of maximum size. They also show 
that any POM in an instance of HA with strict preferences can be obtained through the 
well-known Serial Dictatorship Mechanism (SDM) [1]. SDM is a straightforward greedy 
algorithm that allocates houses sequentially according to some exogenous priority ordering 
of the applicants. 

Recently, the above results have been extended in two different directions. The first one 
[17] considers HA in settings where preferences may include ties. Prior to [17], few works in 
the literature had considered extensions of SDM to such settings. The difficulty regarding 
ties, observed already in [20], is that the assignments made in the individual steps of the 
SDM are not unique, and an unsuitable choice may result in an assignment that violates 
Pareto optimality. In [5] and [20] an implicit extension of SDM is provided (in the former 
case for dichotomous preferences, where an applicant’s preference list comprises a single 
tie containing all acceptable houses), but without an explicit description of an algorithmic 
procedure. Krysta et al. [17] describe a mechanism called the Serial Dictatorship Mecha¬ 
nism with Ties (SDMT) that combines SDM with the use of augmenting paths to ensure 
Pareto optimality. SDMT includes an augmentation step, in which applicants already 
assigned a house may exchange it for another, equally preferred one, to enable another 
applicant to take a house that is most preferred given the assignments made so far. They 
also show that any POM in an instance of HA with ties can be obtained by an execution 
of SDMT and also provide the so-called Random Serial Dictatorship Mechanism with Ties 
(RSDMT) whose (expected) approximation ratio is with respect to the maximum-size 
POM. 

The second direction [8] extends the results of [2] to the many-to-many setting (i.e., 
CA) with strict preferences, while also allowing for a structure of applicant-wise acceptable 
sets that is more general than the one implied by quotas. Namely, [8] assumes that each 
applicant selects from a family of course subsets that is downward closed with respect to 
inclusion. This work provides a characterization of POMs assuming that the preferences of 
applicants over sets of courses are obtained from their (strict) preferences over individual 
courses in a lexicographic manner. Using this characterization, it is shown that deciding 
whether a given matching is a POM can be accomplished in polynomial time. In addition, 
[8] generalizes SDM to provide the Generalized Serial Dictatorship Mechanism (GSD), 
which can be used to obtain any POM for CA under strict preferences. The main idea 
of GSD is to allow each applicant to choose not her most preferred set of courses at once 
but, instead, only one course at a time (i.e., the most preferred among non-full courses 
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that can be added to the courses already chosen). This result is important as the version 
of SDM where an applicant chooses immediately her most preferred set of courses cannot 
obtain all POMs. 

Our contribution. In the current work, we combine the directions appearing in [17] 
and [8] to explore the many-to-many setting in which applicants have preferences, which 
may include ties, over individual courses. We extend these preferences to sets of courses 
lexicographically, since lexicographic set preferences naturally describe human behavior 
[13], they have already been considered in models of exchange of indivisible goods [8, 11] 
and also possess theoretically interesting properties including responsiveness [16]. 

We provide a characterization of POMs in this setting, leading to a polynomial-time 
algorithm for testing whether a given matching is Pareto optimal. We introduce the Gen¬ 
eralized Serial Dictatorship Mechanism with Ties (GSDT) that generalizes both SDMT 
and GSD. SDM assumes a priority ordering over the applicants, according to which ap¬ 
plicants are served one by one by the mechanism. Since in our setting applicants can be 
assigned more than one course, each applicant can return to the ordering several times 
(up to her quota), each time choosing just one course. The idea of using augmenting 
paths [17] has to be employed carefully to ensure that during course shuffling no applicant 
replaces a previously assigned course for a less preferred one. To achieve this, we utilize 
methods and properties of network flows. Although we prove that GSDT can generate 
all POMs using different priority orderings over applicants, we also observe that some of 
the priority orderings guarantee truthfulness whereas some others do not. That is, there 
may exist priority orderings for which some applicant benefits from misrepresenting her 
preferences. This is in contrast to SDM and SDMT in the one-to-one case in the sense 
that all executions of these mechanisms induce truthfulness. This shortcoming however 
is not specific to our mechanism, since we establish that any mechanism generating all 
POMs is prone to strategic manipulation by one or more applicants. 

Organization of the paper. In Section 2 we define our notation and terminology. 
The characterization is provided in Section 3, while GSDT is presented in Section 4. A 
discussion on applicants’ incentives in GSDT is provided in Section 5. Avenues for future 
research are discussed in Section 6. 


applicant 

quota 

preference list 

course 

quota 

ai 

2 

({C1,C2},{C3},0) 

Cl 

2 

02 

3 

({C2},{C1,C3},0) 

C2 

1 

03 

2 

({C3},{02},{01}) 

C3 

1 


Table 1; An instance I of CA. 


2 Preliminary definitions of notation and terminology 

Let A = {oi, 02, • • • , Oni} be the set of applicants, C = {ci, C2, • • • , the set of courses 
and [i] denote the set {l,2,...,i}- Each applicant a has a quota b{a) that denotes the 
maximum number of courses a can accommodate into her schedule, and likewise each 
course c has a quota q{c) that denotes the maximum number of applicants it can admit. 
Each applicant finds a subset of courses acceptable and has a transitive and complete 
preference ordering, not necessarily strict, over these courses. We write c c' to denote 
that applicant a (strictly) prefers course c to course c', and c ~a c' to denote that a is 
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indifferent between c and c'. We write c c' to denote that a either prefers c to c' or is 
indifferent between them, and say that a weakly prefers c to cb 

Because of indifference, each applicant divides her acceptable courses into indifference 
classes such that she is indifferent between the courses in the same class and has a strict 
preference over courses in different classes. Let Cf denote the f’th indifference class, or 
fie, of applicant a where t G [n 2 ]. The preference list of any applicant a is the tuple of sets 
Cf, i.e., P{a) = {Cf, Cf, • • • , we assume that Cf = 0 implies Cf', = 0 for all t' > t. 
Occasionally we consider P{a) to be a set itself and write c G P{a) instead of c G Cf 
for some t. We denote by V the joint preference profile of all applicants, and by V{—a) 
the joint profile of all applicants except a. Under these definitions, an instance of CA is 
denoted by / = {A,C,P,b,q). Such an instance appears in Table 1. 

A (many-to-many) assignment ^ is a subset of A x C. For a G A, = {c G C : 
{a,c) G n} and for c G C, ii{c) = {a G A : (a, c) G pf. An assignment is a matching 
if //(a) C P{a) —and thus p. is individually rational, |/i(a)| < 6(a) for each a G A and 
|/u(c)| < q{c) for each c G C. We say that a is exposed if |/r(a)| < 6(a), and is full 
otherwise. Analogous definitions of exposed and full hold for courses. 

For an applicant a and a set of courses 5, we define the generalized characteristic vector 
Xa{S) as the vector (|5 n Cf|, |5 n Cfl,..., |5 n We assume that for any two sets 

of courses S and U, a prefers 5 to C/ if and only if Xa{S) >iex Xa{U), i.e., if and only if 
there is an indifference class Cf such that jS* n Cf | > \U GiCf] and |5 n (7(51 = \U H (7(5| 
for all t' < t. If a neither prefers S to U nor U to S, then she is indifferent between S and 
U. We write S)~aU if a prefers S to U, S cza U it a is indifferent between S and U, and 
S PaU it a weakly prefers S to U. 

A matching /x is a Pareto optimal matching (POM) if there is no other matching in 
which some applicant is better off and none is worse off. Formally, is Pareto optimal if 
there is no matching jj,' such that p![a] Pa p-{o) for all a G A, and p!{a'))~a'h{o!) for some 
a' G A. If such a p' exists, we say that p' Pareto dominates p. 

A deterministic mechanism f maps an instance to a matching, i.e. f : I p where I is 
a CA instance and pis a matching in L A randomized mechanism f maps an instance to a 
distribution over possible matchings. Applicants’ preferences are private knowledge and an 
applicant may prefer not to reveal her preferences truthfully. A deterministic mechanism 
is dominant strategy truthful (or just truthful) if all applicants always find it best to declare 
their true preferences, no matter what other applicants declare. Formally speaking, for 
every applicant a and every possible declared preference list P'{a), (p{P{a),P{—a)) Pa 
4>{P'{a),P{—a)), for all P{a),V{—a). A randomized mechanism f is universally truthful 
if it is a probability distribution over deterministic truthful mechanisms. 

3 Characterizing Pareto optimal matchings 

Manlove [18, Sec. 6.2.2.1] provided a characterization of Pareto optimal matchings in 
HA with preferences that may include indifference. He defined three different types of 
coalitions with respect to a given matching such that the existence of either means that 
a subset of applicants can trade among themselves (possibly using some exposed course) 
ensuring that, at the end, no one is worse off and at least one applicant is better off. 
He also showed that if no such coalition exists, then the matching is guaranteed to be 
Pareto optimal. We show that this characterization extends to the many-to-many setting, 
although the proof is more complex and involved than in the one-to-one setting. We then 
utilize the characterization in designing a polynomial-time algorithm for testing whether 
a given matching is Pareto optimal. 

In what follows we assume that in each sequence C no applicant or course appears 
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more than once. 

An alternating path coalition w.r.t. /a comprises a sequence £ = 

where r > 1, Cj^ G //(a^) (0 < /c < r - 1), Cj^ 0 ^( 04 - 1 ) 
(1 < A: < r), Oig is full, and Cj^ is an exposed course. Furthermore, ai^ prefers Cj^ to 
Cjg and, if r > 2, weakly prefers to Cj^ {1 <k <r — 1). 

An augmenting path coalition w.r.t. p, comprises a sequence £ = (ajg,Cj^,aq, 

• • • where r > 1, Cj^ G n{aij {1 <k <r -1), cj^ 0 /i(a4-i) (1 < A; < r), 

ajp is an exposed applicant, and Cj^ is an exposed course. Furthermore, ajg finds 
acceptable and, if r > 2, 0 *^, weakly prefers to Cj,, (1 < A < r — 1). 

A cyclic coalition w.r.t. p. comprises a sequence € = (cjg, a^g, , aq,..., , 

air-i) where r > 2, Cj^^ G //(aq) (0 < A < r - 1), and Cj^ 0 ^(aifc-i) (1 < A < r). 

Furthermore, aq prefers Cj-^ to Cjg and aq weakly prefers to cq (1 < A < r — 1). (All 
subscripts are taken modulo r when reasoning about cyclic coalitions). 

We define an improving coalition to be an alternating path coalition, an augmenting 
path coalition or a cyclic coalition. 

Given an improving coalition (£, the matching 

= (/^\ {(a 4 >cq) ; -5 < A < r - 1}) U {(aq,cq^J : 0 < A < r - 1}} (1) 

is defined to be the matching obtained from // by satisfying £ (5 = 1 in the case that C is 

an augmenting path coalition, otherwise (5 = 0). 

We will soon show that improving coalitions are at the heart of characterizing Pareto 
optimal matchings (Theorem 2). The next lemma will come in handy in the proof of 
Theorem 2 and Theorem 10. We say that a sequence of applicants and courses is a 
pseudocoalition if it satisfies all conditions of an improving coalition except that some 
courses or applicants may appear more that once. 

Lemma 1. Let a matching ja in an instance I of CA admit a pseudocoalition K of length 
£ for some finite i. Then p. admits an improving coalition £ of length at most 1. 

Proof. We prove this by induction. Obviously £ > 2. For the base case, it is easy to see 
that K itself is an augmenting path coalition (where r = 1) if £ = 2 , an alternating path 
coalition (where r = 1) if £ = 3, a cyclic coalition (where r = 2) if £ = 4 and K ends 
with an applicant, and an augmenting path coalition (where r = 2) if .A = 4 and K ends 
with a course. Assume that the claim holds for all pseudocoalitions of length d, d < £. 
We show that it also holds for any given pseudocoalition K of length £. In the rest of the 
proof we show that either K is in fact an improving coalition, or we can derive a shorter 
pseudocoalition K' from K, hence completing the proof. 

The case for a repeated course; Assume that a course c appears more than once 
in K. We consider two different scenarios. 

1. If c is not the very first element of K, then sequence K is in the following form where 
where c = cq = cq. 

• • • ) 1 1 ) Gy > • • • 

In this case, we simply delete the portion of K under the arrow. Note that in the 
new sequence cq appears right after after aq_^, but then cq = cq. Hence we have 
obtained a shorter sequence, and it is easy to verify that this new sequence is a 
pseudocoalition. 

2. If c is the first element of AT, then K is in the following form where c = cq = cq. 

^jo) Q-io) • • • ) ®q-l ) Gy ’ ■ • • 
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In this case, we simply only keep the portion of K under the arrow, i.e. our new 
sequence starts with and ends with The new sequence is shorter and it is 

easy to verify that it is a pseudocoalition. 

The case for a repeated applicant: Assume that an applicant a appears more than 
once in K. We consider the three different possible scenarios. 

1. If a is not the very first element of K, nor the last one, then K is in the following 

form where a = . 

■ ■ ■ 1 ^jx ) ^ix ) ^jx + l ) • • ■ ) ' 

Note that Cj^ and cj^ (since = aj^). We consider two 

different cases. 

(a) Cj^+i^ai^Cjy, then we simply only keep the portion of K under the arrow and 

add Cjy to the beginning. That is, the new sequence is K' = , 

• K' is shorter than K and it is easy to verify that that it is a 

pseudocoalition. 

(b) If Cjy ) then Cj^. Then we simply remove the portion of K 

from up to That is, the new sequence is K' = (... Cj^, • • •) 

which is shorter than K. Note that if either Cj^+ior then 

Cjyj^^)^ai^Cjx- Hence it is easy to verify that K' is a pseudocoalition. 

2. If a is the first element of K but not the last one, then K is in the following form 
where a = aig = ai . 

jr. -^-> 

, . . . , Cjy , aiy , , • • • 

Notice that uq is exposed and finds acceptable and is not matched to it 

under //. Hence we simply remove the portion of K under the arrow and get 
K' = (ojg, ...). K' is shorter than K and it is easy to verify that it is a 
pseudocoalition. 

3. If a is the last element of K but not the first one, then K is of the following form 

where a = . 

CjQ ,..., Cj ^, ,..., o,iy-i ) Cjy , aiy 

Note that, as = ai^, Cj^ and cjg cj^. Furthermore, is not 

matched to cj^ in fj,. We consider two different cases. 

(a) If Cjy , then we do as we did in Case 1(a). That is, we only keep the 

portion of K under the arrow and add to the beginning; the new sequence 
is K' = {cjy, Qi^, ..., aiy_^). K' is shorter than K and it is easy to verify 
that it is a pseudocoalition. 

(b) If Cjy Cj^+i, then Cj^+i Cj^- Then we simply remove the 

portion of K from until the end of the sequence. That is, the new sequence 
is K' = (cjQ ,,Cj^,ai^). K' is shorter than K and it is easy to verify that it 
is a pseudocoalition. 

Note that we do not need to check the case where K starts and ends with an applicant, 
since then K would not fit the definition of an improving coalition. □ 

The following theorem gives a necessary and sufficient condition for a matching to be 
Pareto optimal. 
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Theorem 2. Given a CA instance I, a matching fi is a Pareto optimal matching in I if 
and only if fi admits no improving coalition. 

Proof. Let /x be a Pareto optimal matching in I. Assume to the contrary that p, admits an 
improving coalition It is fairly easy to see that matching obtained from p according 
to equation (1) Pareto dominates p., a contradiction. 

Conversely, let ^ be a matching in I that admits no improving coalition, and suppose 
to the contrary that it is not Pareto optimal. Therefore, there exists some matching 
p' p such that p' Pareto dominates p. Let = p® p' he the graph representing the 
symmetric difference of p and p'. is hence a bipartite graph with applicants in one 

side and courses in the other; by abusing notation, we may use a or c to refer to a node in 
corresponding to applicant a £ A or course c G C, respectively. Note that each edge 
of Gfj_^fj,i either belongs to p, referred to as a p-edge, or to p', referred to as a p'-edge. 

We first provide an intuitive sketch of the rest of the proof. Our goal is to find an 
improving coalition, hence contradicting the assumption that p does not admit one. We 
start with an applicant ajg who strictly prefers p' to p. We choose a ^'-edge {oi ^, Cjj) such 
that belongs to the first indifference class where p' is better than p for ajp. Either 
this edge already represents an improving coalition—which is the case if is exposed in 
p —or there exists a p-edge incident to . In this fashion, we continue by adding edges 
that alternatively belong to p' (when we are at an applicant node) and p (when we are 
at a course node), never passing the same edge twice (by simple book-keeping). Most 
importantly, we always make sure that when we are at an applicant node, we choose a 
p'-edge such that the applicant weakly prefers the course incident to the ^'-edge to the 
course incident to the precedent p-edge. We argue that ultimately we either reach an 
exposed course (which implies that an augmenting or an alternating path coalition exists 
in p) or are able to identify a cyclic coalition in p. 

Let us first provide some definitions and facts. Let A' be the set of applicants who prefer 
p' to p; i.e.. A' = {a\p'{a))~aP{o)}- Moreover, for each a £ A, let P\^{a) = p'{a) \ p{a) 
and p\i^i{a) = p{a) \ p'{a). Likewise, for each course c £ C, let p\n'{c) = p{c) \ p'{c) 
and P\^{c) = p'{c) \ p{c). Note that these sets will be altered during the course of the 
proof. In what follows and in order to simplify presentation, we will say that we remove 
a p'-edge (a, c) from G^^^/ to signify that we remove c from P\^{a) and remove a from 
/r(^^(c); similarly, we say that we remove a p-edge (a,c) from to signify that we 

remove c from p\^i{a) and remove a from /r\^/(c). The facts described below follow from 
the definitions and the assumption that p' Pareto dominates p. Let T be a set containing 
applicants (initially, T = 0); we will explain later what this set will come to contain. 

Fact 1: For every course c that is full under p, it is the case that |/r\^'(c)| > \p'^^^{c)\. 

Fact 2: For every applicant a G A' \ T, there exists £* < n 2 such that |G“ n p'^^^{a)\ = 
|G“ n p\^i{a)\ for all £ < and |G“* n p'\^^{a)\ > |G“* n p\^i{a)\. Hence, G“* denotes the 
first indifference class of a in which p' is better than p for a. 

Fact 3: For every applicant a £ A' that is full in p, there exists , £* < < n 2 , such 

that |G“+ n/i\^/(a)| > |G“+ n^'^^(a)|. 

In what follows, we iteratively remove edges from G^^^/ (so as not to visit an edge 
twice and maintain the aformentioned facts) and create a corresponding path Gl, which we 
show that in all possible cases implies an improving coalition in p, thus contradicting our 
assumption. 

As p' Pareto dominates p, A' is nonempty. Let be an applicant in A'. By Fact 2, 
there exists a course G G^* n p', Hence, we remove edge (aiQ,Cj^) from G^^^/ 
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and add it to i.e., £ = (ajp, Cj^). If Cj^ is exposed, then an augmenting path coalition is 
implied. More specifically, if Oig is also exposed in then £ = (ajp,Cjj) is an augmenting 
path coalition. Otherwise, if ajg is full in then it follows from Fact 3 that there exists a 
course Cjg G /i\^/(aig) such that Cj^)^a^^CjQ. Then = (cjg, Ojg, Cji) is an alternating path 
coalition. 

If, on the other hand, Cj^ is full, we continue our search for an improving coalition in 
an iterative manner (as follows) until we either reach an exposed course (in which case 
we show that an augmenting or alternating coalition is found) or revisit an applicant 
that belongs to A' (in which case we show that a cyclic coalition is found). Note that 

^ hence it follows from Fact 1 that >1. At the start of each 

iteration A: > 1, we have: 

^ — (o-ig, Cjj, . . . , , Cj^,) 

such that by the construction of d (through the iterative procedure) every applicant 
on £ weakly prefers the course that follows her on C— i.e., to which she is matched 
to in fi' but not in fi —to the course that precedes her—i.e., Cj^ to which she is matched to 
in fi but not in fi’. Moreover, Ujg is either exposed and can accommodate one more course, 
namely Cj^, or is full and hence there exists a course Cj^ G such that Ojg strictly 

prefers Cjj to cjg. Notice that C would imply an improving coalition (using Lemma 1) if 
Cj^ is exposed. 

During the iterative procedure, set T includes those applicants who strictly prefer the 
course they are given in € under fi' than under fi. That is, T = {a^ G € : 

Cj^} U {uig}. Note that applicant a^g is also included in T, since either she is exposed, or 
3cjg G fi\f,'{aio) such that Cjj^aigCjg. 

The Iterative Procedure: We repeat the following procedure until either the last course 
on £, Cjf ,, is exposed or Case 3 is reached. If Cj,, is full, then by Fact 1 there must exist an 
applicant ai^ G fi\^,{ci^); note that ai^ / a 4 _^. Remove from Let C/'‘ be 

the indifference class of ajj, to which cj^, belongs. We consider three different cases. 

• Case 1: n ^(^^(aij,)| > 1(7“** n /U\^/(a 4 )|. Then there must exist a course 

^ ^ that Hence, we remove from 

and append ( 04 , 0 ^^^^^) to C. 

• Case 2: |G^** H fi'\^^{aii,)\ < |G“** n /^\/i'(® 4 )l ai^, ^ T. Recall that fi' Pareto 
dominates fi, hence it must be that 04 G A'. Therefore if follows from Fact 2 that 

< £ and there exists a course G G^** n ^(^^( 04 ); note that C 4 / 

Hence, we remove (aij,,C 4 _^J from G^^^/ and append to £. Moreover, we 

add 04 to T. 

• Case 3: |G^** n //(^^(a 4 )| < |G“** n /i\^'(a 4 )| and 04 G T; note that this case 

cannot happen when k = 1. Let 04 be the first occurrence of 04 which has resulted 
in 04 being added to T; i.e., 0 < z < k and 04 = Therefore, d is of the form 
( 0 . 4 , C 4 , 0 : 4 ,, 0 : 4 , ? * * • ^ik—i ? ^jk ) * bet US consider i^jk ’ ^ * * * 1 ) * 

Since 04 = 04 G T, 04 G A' and, by the construction of £, G G^J , it follows 
from Fact 2 that £*. < £, which implies (using the assumption of Case 3) that 

If at any iteration k we find a C 4 that is exposed in fi or we arrive at Case 3, then an 
improving coalition is implied (using Lemma 1), which contradicts our assumption on fi. 
Otherwise (cases 1 and 2), we continue with a new iteration. However, since the number 



of edges is bounded, this iterative procedure is bound to terminate, either by reaching an 
exposed course or by reaching Case 3. □ 

It follows from Theorem 2 that in order to test whether a given matching fx is Pareto 
optimal, it is sufficient to check whether it does not admit any improving coalition. To 
check this, we construct the extended envy graph associated with jj,. 

Definition 1. The extended envy graph G{fx) = associated with a matching 

fx is a weighted digraph with Vg[^) = VaUVcUP^, where Vc = {c : c G C}, Va = {a : a G A} 
and Vf, = {ac : {a, c) G ii], and Eg(^) = U U where: 

• ~ ■ a G A, c G C, and |^(c)| < Q'(c)} |J{(c, ac') : c G C and \fx{c)\ < 

g(c)}, with weight equal to 0; 

• ~ c) : a G ^4, c G P{a)\fx{a) and \fx{a)\ < b{a)} ci'c) ■ a G A\{a'}, c G 

P{a) \ fx{a) and \fx{a)\ < b{a)}, with weight equal to —1; and 

• ~ {(®c, c') : c' G P{a) \ fx{a) and c' Pa c} |J{(®c, a'c') : c' G P{a) \ fx{a) and 
c' Pa c}, with weight equal to 0 if c' c^a c or —1 if c' Pa c. 

That is, G{fx) includes one node per course, one node per applicant and one node per 
applicant-course pair that participates in fj,. With respect to Eqi^^-j, a ‘0’ (‘—1’) arc is 
an arc with weight 0 (—1). There is a ‘0’ arc leaving from each exposed course node to 
any other (non-course) node in G(/i) (arcs in Moreover, there is a —1 arc leaving 

each exposed applicant node a towards any course node or applicant-course node whose 
corresponding course is in P{a) \ ix{a) (arcs in Finally, there is an arc leaving 

each applicant-course node ac towards any course node or applicant-course node whose 
corresponding course c' is not in fx{a) and c' Pa c. The weight of such an arc is 0 if c' ~a c 
or —1 if c' Pa c (arcs in 

The following theorem establishes the connection between Pareto optimality of a given 
matching and the absence of negative cost cycles in the extended envy graph corresponding 
to that matching. 

Theorem 3. A matching qi is Pareto optimal if and only if its extended envy graph G{iJ,) 
has no negative cost cycles. 

Proof. First we show that if G{jx) has no negative cost cycle, then fx is Pareto opti¬ 
mal. Assume for a contradiction that fx is not Pareto optimal; this means that there 
exists an improving coalition £. We examine each of the three different types of im¬ 
proving coalition separately. Assume that £ is an alternating path coalition, i.e., C = 

where r > 1, (a 4 ,CjJ G p. {0 < k < r-1), (a 4 _^,cjJ 0 
p {I < k < r), aiQ is full, and cj^ is an exposed course. Furthermore, Pai^ Cjo and, 
if r > 2, —<^ik {1 < k < r — 1). By Definition 1, there exists in G{p) an arc 

{ai^Cj^,ap^.^Cj^^.^) for each 0 < A: < r - 2, an arc {ap_iCp_i,cp), and an arc (cp, aif^Cjg), 
thus creating a cycle K in G{p). Since all arcs of K have weight 0 or —1 and (oipCjp, 
has weight —1 (because Cjj Pai^ Cjo), K is a negative cost cycle, a contradiction. A similar 
argument can be used for the case where £ is an augmenting path coalition; our reasoning 
on the implied cycle would start with a ‘—1’ arc (ai(,,aj^CjJ and would conclude with a 
‘0’ arc (cj^,ajg). Finally, for a cyclic coalition €, the implied negative cost cycle includes 
all arcs ( 0 *^, 0 ^^,, for 0 < /c < r — 1 taken modulo r, where at least one arc has 

weight —1 (otherwise, (t would not be a cyclic coalition). In all cases, the existence of an 
improving coalition implies a negative cost cycle in G{p), a contradiction. 
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Next, we show that if /j, is Pareto optimal, G{fi) has no negative cost cycles. We 
prove this via contradiction. Let ^ be a POM, and assume that G{p) has a negative cost 
cycle. Given that there exists a negative cost cycle in G{fj,), there must also exist one that 
is simple and of minimal length. Hence, without loss of generality, let iL be a minimal 
(length) simple negative cost cycle in G{n). We commence by considering that K contains 
only nodes in V)j. We create a sequence 1C corresponding to K as follows. Starting from 
an applicant-course node of K with an outgoing ’—1’ arc, say ai^Cjg, we first place cjg in 
JC followed by a^g, and continue by repeating this process for all nodes in K. It is then 
easy to verify, by Dehnition 1 and the definition of a cyclic coalition, that K corresponds 
to a sequence K, = {cjg , Oip,..., Cj ^_^, ) that satisfies all conditions of a cyclic coalition 

with the exception that some courses or applicants may appear more than once. But this 
means that, by Lemma 1, /x admits an improving coalition, which contradicts that /x is a 
POM. 

To conclude the proof, we consider the case where K includes at least one node from 
Va or Vc- Note that if K contains a node from Va, then it also contains a node from Vc- 
This is because, by Definition 1, the only incoming arcs to an applicant node are of type 
Therefore K includes at least one course node. We now claim that since K is of 
minimal length, it will contain only one course node. Assume for a contradiction that K 
contains more than one course nodes. Since K is a negative cost cycle, it contains at least 
one ‘—1’ arc which leaves an applicant node a (or an applicant-course node ac'). Let c be 
the hrst course node in K that appears after this ‘—1’ arc. Recall that for c to be in K, c is 
exposed in /x and there exists a ‘0’ arc from c to any applicant or applicant-course node of 
G{fi). Hence, there also exists such an arc from c to a (or to ac'), thus forming a negative 
cost cycle smaller than K, a contradiction. Note that this also implies that K contains at 
most one node in Va- This is because, by Definition 1, the only incoming arcs to nodes 
in Va are from nodes in Vc, therefore the claim follows from the facts that c is the only 
course node in K, and AT is a simple cycle. We create a sequence K, corresponding to K as 
follows. If K contains an applicant node a, we start 1C with a. If K contains a course node 
c, we end K, with c. Any applicant-course node a'c' is handled by placing c' first and then 
a'. It is now easy to verify, by Definition 1 and the definition of a cyclic coalition, that K 
corresponds to a sequence K, = {aig,c 


*0 5 '-Jl ) ) ■ • • ) '-'Xt— 1 ) “*r-l ) '-'X 


a,-. 


, c,' 


(if K contains an applicant 
node) or a sequence K, = {cjg,aig ,..., (if K contains no applicant node), 

such that 1C satisfies all conditions of either an augmenting path coalition (if K contains 
an applicant node) or an alternating path coalition (if K contains no applicant node), 
with the exception that some courses or applicants may appear more than once. But this 
means that, by Lemma 1, /x admits an augmenting path coalition (or an alternating path 
coalition), which contradicts that /x is a POM. □ 


Theorem 3 implies that to test whether a matching /x is Pareto optimal or not, it 
suffices to create the corresponding extended envy graph and check whether it contains 
a negative cost cycle. It is easy to see that we can create G{^) in polynomial time. To 
test whether G(/x) admits a negative cycle, we can make use of various algorithms that 
exist in the literature (see [10] for a survey), e.g. Moore-Bellman-Ford which terminates 
in 0(|VG(/i)ll-®G(/i)l) time. However, as all arcs in G{^) are of costs either 0 or —1 and 
hence integer, we can use a faster algorithm of [14] which, for our setting, terminates in 

0{^J\Vg(^,)\\Eg(^,)\) time. 

Corollary 4. Given a CA instance I and a matching /x, we can check in polynomial time 
whether /x is Pareto optimal in I. 
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4 Constructing Pareto optimal matchings 


We propose an algorithm for finding a POM in an instance of CA, which is in a certain 
sense a generalization of Serial Dictatorship thus named Generalized Serial Dictatorship 
Mechanism with ties (GSDT). The algorithm starts by setting the quotas of all appli¬ 
cants to 0 and those of courses at the original values given by q. At each stage i, the 
algorithm selects a single applicant whose original capacity has not been reached, and 
increases only her capacity by 1. The algorithm terminates after B = stages, 

i.e., once the original capacities of all applicants have been reached. In that respect, the 
algorithm assumes a ‘multisequence’ S = (a^, a^,..., o'®) of applicants such that each 
applicant a appears 6 (a) times in S; e.g., for the instance of Table 1 and the sequence 
S = (ai, ai, 02 , 02 , 03 , 02 , 03 ), the vector of capacities evolves as follows: 

( 0 , 0 , 0 ), ( 1 , 0 , 0 ), ( 2 , 0 , 0 ), ( 2 , 1 , 0 ), ( 2 , 2 , 0 ), ( 2 , 2 , 1 ), ( 2 , 3 , 1 ), ( 2 , 3 , 2 ). 

Let us denote the vector of applicants’ capacities in stage i by 6 *, i.e., 6 ° is the all¬ 
zeroes vector and 6 '® = 6 . Clearly, each stage corresponds to an instance P similar to 
the original instance except for the capacities vector 6 *. At each stage i, our algorithm 
obtains a matching qd for the instance P. Since the single matching of stage 0, i.e., the 
empty matching, is a POM in I®, the core idea is to modify in such way that if 
is a POM with respect to P~^ then /x* is a POM with respect to P. To achieve this, the 
algorithm relies on the following flow network. 

Consider the digraph D = {V,E). Its node set is C = AuTUCUIcr, r} where a and r 
are the source and the sink and vertices in T correspond to the ties in the preference lists 
of all applicants; i.e., T has a node (o, t) per applicant a and tie t such that Cf / 0. Its arc 
set is i? = illiU£' 2 U£' 3 ULI 4 where £1 = {(a, o) : o G A}, £2 = {(o, {a,t)) : o G A, Cf / 0}, 
-^3 = : c G Cf} and £4 = {(c,r) : c G C}. The graph D for the instance of 

Table 1 appears in Figure 1, where an oval encircles all the vertices of T that correspond 
to the same applicant, i.e., one vertice per tie. 

Using digraph D = {V, £), we obtain a flow network iV* at each stage i of the algorithm, 
i.e., a network corresponding to instance P, by appropriately varying the capacities of the 
arcs. (For an introduction on network flow algorithms see, e.g., [3].) The capacity of each 
arc in £3 is always I (since each course may be received at most once by each applicant) 
and the capacity of an arc e = (c, r) G £4 is always q{c). The capacities of all arcs in 
£1 U £2 are initially 0 and, at stage i, the capacities of only certain arcs associated with 
applicant o* are increased by 1. For this reason, for each applicant a we use the variable 
curr{a) that indicates her ‘active’ tie; initially, curr{a) is set to 1 for all o G A. 

In stage i, the algorithm computes a maximum flow /* whose saturated arcs in £3 
indicate the corresponding matching | 0 *. The algorithm starts with /^ = 0 and pP = ^. 
Let the applicant o* G A be a copy of applicant a considered in stage i. The algorithm 
increases by 1 the capacity of arc {a, a) G £1 (i.e., the applicant is allowed to receive an 
additional course). It then examines the tie curr{a) to check whether the additional course 
can be received from tie curr{a). To do this, the capacity of arc {a, {a, curr{a))) G £2 
is increased by 1. The network in stage i where tie curr{ad) is examined is denoted by 
j\ji,curr{w) ^ If there is an augmenting a — t path in this network, the algorithm augments 
the current flow to obtain /*, accordingly augments to obtain pd (i.e., it sets pd 
to the symmetric difference of pd~^ and all pairs (o, c) for which there is an arc ((o, t), c) in 
the augmenting path) and proceeds to the next stage. Otherwise, it decreases the capacity 
of (o, (o, corr(a))) by 1 (but not the capacity of arc ( 0 , 0 )) and it increases curr{a) by 1 
to examine the next tie of o; if all (non-empty) ties have been examined, the algorithm 
proceeds to the next stage without augmenting the flow. Note that an augmenting a — t 
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Figure 1: Digraph D for the instance I from Table 1. 

path in the network j\r*>cnrr(a®) corresponds to an augmenting path coalition in with 
respect to P. 

A formal description of GSDT is provided by Algorithm 1, where w{e) denotes the 
capacity of an arc e G E and © denotes symmetric difference. Observe that all arcs in E 2 
are saturated, except for the arc corresponding to the current applicant and tie, thus any 
augmenting path has one arc from each of Ei, E 2 and FI 4 and all other arcs from E^; as 
a consequence, the number of courses each applicant receives at stage i in any tie cannot 
decrease at any subsequent step. Also, /U* dominates with respect to instance P if 
and only if there is a flow in that saturates all arcs in E 2 . 


Algorithm 1: Producing a POM for any instance of CA 
Input: an instance I of CA and a multisequence S 
/o := 0 ; /iO := 0 ; 
for each a G A, curr{a) := 1; 
for i = 1,2,..., B do 
{ 

consider the applicant a = al; 
w{a, a)++; 

^3 1 = 0 ; 

while ^ = 0 and curr{a) < n 2 and / 0 do 

{ 

w{a, (a, curr{a)))++; 

‘P := augmenting path in with respect to 

if ip = 0 then { w{a, {a, curr{a)))--; curr{a)++}; 

} 

if ^ / 0 then {/*:=/* ^ © 

iP := © {(a, c) : ((a, t), c) G ^ for some tie t}; } 

otherwise { /* := ;= } 

} 

return /x^; 


To prove the correctness of GSDT, we need two intermediate lemmas. Let ct G 
be the vector having 1 at entry t and 0 elsewhere. 

Lemma 5. Let A*’* be the network at stage i while tie t of applicant a* is examined. Then, 
there is an augmenting path with respect to in A*’* if and only if there is a matching 
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fj, such that 

Xa{h{a)) = Xa(/U*“Ha)) for each a ^ and Xa^iKa")) = + et- 

Proof. Note that the flows and /*, corresponding to and fp respectively, are 
feasible in iV*. Moreover, is feasible in A^*’* and, if there is an augmenting path with 
respect to in then /* is feasible in Af*’* too. 

If there is a path augmenting by 1 in thus obtaining /*, the number of courses 
assigned per tie and applicant remain identical except for a* that receives an extra course 
from tie t. Thus, Xa{p{a)) is identical to Xa(/U*~^(a)) for all a G yl except for a* whose 
characteristic vector has its t’th entry increased by 1. 

Conversely, if the above equation holds, flow /* is larger than in the network A^*’*. 
Thus standard network flow theory implies the existence of a path augmenting As 

a side remark, since the network A^*’* has similar arc capacities with the except for 

the arcs {a, ah) and (a*, (a*,t)), this augmenting path includes these two arcs. 

□ 

Lemma 6. Let S '^a U and IS"! > |17|. If cs and cu denote a least preferred course of 
applicant a in S and U, respectively, then 5\{cs’} U\{cu}. 

Proof. For convenience, we denote 5\{cs'} by S' and U\{cij} by U'. Let 
Xa{S) = (S 1 ,S 2 , • • • ,5^2), Xa{U) = {ui, U 2 , . . . , Un^) 

and 

Xa{S') = (s'i,4 ,---,Sn2 )> Xa{U') = {u'l, u' 2 , . . . , . 

If S ~a u, then cs cu and Xa{S) = Xa{U), therefore XaiS') = XaiU') and S' ~a U'. 
Otherwise, S )^a U implies that there is A: G [712] such that Sj = Uj for each j < k and 

If there is j > k + 1 with Sj > 0, then cs G Cf with r > k + 1. It follows that 
4 = Sk > Uk > u'j. while s'- > 4 i < Xa{S') >iex Xa{U'), i.e.. S' U'. 

The same follows if + 2 because then s'j^ > Sk — 1 > Uk > u'^. 

It remains to examine the case where Sj = 0 for all j > A: + 1 and s^ = Uk + h. In 
this case, cs G Cf thus — 1 = Uk, while IS"! > \U\ implies that either Uj = 0 for 

all J > A + 1 or there is a single k' > k such that Uk' = 1. In the former case cjj G Cf 
for r < A thus Xa{S') >iex XafU'), whereas in the latter one = Uk = Sk — I = s'j^ and 
s'j =4 = 0 for all j > k hence Xa{S') = Xa{U'). □ 

Theorem 7. For each i, the matching fp obtained by GSDT is a POM for instance P. 

Proof. We apply induction on i. Clearly, = 0 is the single matching in I® and hence a 
POM in I^. We assume that fp~^ is a POM in P~^ and prove that ph is a POM in P. 

Assume to the contrary that /x* is not a POM in P. This implies that there is a 
matching ^ in P that dominates p'. Then, for all a £ A, (^{a) Pa p'ia) Pa Recall 

that the capacities of all applicants in P are as in P~^ except for the capacity of a* that 
has been increased by 1. Hence, for all a G A \ {oj}, |4®)l does not exceed the capacity 
of a in instance P~^, namely 6*“^(a), while exceed by at most 1. 

Moreover, it holds that |4 q*)I ^ I4(®*)l- Assuming to the contrary that |4®*)l < 
|;U*(a*)| yields that ^ is feasible also in instance P~^. In addition, < I4(®*)l implies 

that it cannot be ^{a') c^a^ p''{ah) thus, together with ^{ah) Pat p''{ah) Pai /x*“^(a*), it 
yields 4®*) Pp Rut then ^ dominates /x*“^ in P~^, a contradiction 

to p'‘~^ being a POM in P~^. 
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Let us first examine the case in which GSDT enters the ‘while’ loop and finds an 
augmenting path, hence fJ" dominates in P only with respect to applicant a* that 
receives an additional course. This is one of her worst courses in ^*(a*) denoted as c^. Let 
be a worst course for a* in Let also and //' denote ^\{(a*, c^)} and c^)}, 

respectively. Observe that both and are feasible in P~^, while having shown that 
|^(a*)| > implies through Lemma 6 that weakly dominates fx' which in turn 

weakly dominates by Lemma 5. Since is a POM in P~^, —a {o) —a 

/i*“^(a) for all a ^ A, therefore ^ dominates only with respect to a* and c^. 

Overall, ^(a) ~a M*(a) ~a iP~^{a) for all a G A \ {a*} and .^(a*) 

Let and be the ties of applicant a* containing and c^j,, respectively, where 
because Then, Lemma 5 implies that there is a path augmenting (i.e., 

the flow corresponding to in the network Let also P be the value of curr(a^) 

at the beginning of stage i. Since we examine the case where GSDT enters the ‘while’ 
loop and finds an augmenting path, ^ 0. Thus, t' indexes the least preferred tie from 
which a* has a course in The same holds for since /U*“^(a*). Because 

is obtained by removing from a* its worst course in ^(a*), that course must belong to a 
tie of index no smaller than t', i.e., t' < This together with yield t' < < t^, 

which implies that GSDT should have obtained ^ instead of fp at stage i, a contradiction. 

It remains to examine the cases where, at stage i, GSDT does not enter the ‘while’ 
loop or enters it but finds no augmenting path. For both these cases, /x* = thus 

^ dominating fp means that ^ is not feasible in P~^ (since it would then also dominate 
Tiien, it holds that |C(a*)| exceeds by 1 , thus |C(a*)| > |/x*(a*)| yielding 

^(a*) ;U*(a*). Let be defined as above and t' now be the most preferred tie from 

which a* has more courses in ^ than in fP. Glearly, t' < since indexes the least 
preferred tie from which P has a course in If t' < then the matching defined as 
above, is feasible in P~^ and dominates because {P) iP~^{P), a contradiction; 

the same holds if t' = and P has in ^ at least two more courses from than in 
Otherwise, t' = and P has in ^ exactly one more course from than in fp] that, 
together with |^(a*)| > |/i*(a*)| and the definition of implies that the index of the least 
preferred tie from which P has a course in and, therefore, the value of curr{P) in 
the beginning of stage i, is at most P. But then GSDT should have obtained ^ instead of 
fp at stage i, a contradiction. 

□ 


The following statement is now direct. 

Corollary 8. GSDT produces a POM for instance L 

To derive the complexity bound for GSDT, let us denote by L the length of the pref¬ 
erence profile in I, i.e., the total number of courses in the preference lists of all applicants. 
Notice that IT’s! = L and neither the size of any matching in I nor the total number of 
ties in all preference lists exceeds L. 

Within one stage, several searches in the network might be needed to find a tie of 
the active applicant for which the current flow can be augmented. However, one tie is 
unsuccessfully explored at most once, hence each search either augments the flow thus 
adding a pair to the current matching or moves to the next tie. So the total number of 
searches performed by the algorithm is bounded by the size of the obtained matching plus 
the number of ties in the preference profile, i.e., it is 0{L). A search requires a number of 
steps that remains linear in the number of arcs in the current network (i.e., 
but as at most one arc per Ei, E 2 and E 4 is used, any search needs 0(|£^3|) = 0{L) steps. 
This leads to a complexity bound 0{L^) for GSDT. 
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The next theorem will come in handy when implementing Algorithm 1, as it implies 
that for each applicant a only one node in T corresponding to a has to be maintained at 
a time. 

Theorem 9. Let be the network at stage i while tie t of applicant a* is examined. 
Then, there is no augmenting path with respect to in that has an arc of the form 
{{a^,£),c) where j < i and i < curr{a^). 

Proof. Assume otherwise. Let be such an augmenting path that is found in round i 
and used to obtain ph. Hence ip corresponds to an augmenting path coalition £ of the 
following form (a*, Cr,..., Cg, , Cq,...) where Cr is in the t’th indifference class of a* and 
both Cg and Cq are in the £’th indifference class of , £ < curr{a^). Note that as £ > 1 
thus curr{a^) > 1. It then follows from the description of Algorithm 1 that either is 
matched to at least one course in curr{a^) under or she is exposed (which would be the 
case when curr{a^) > n 2 or = 0)- 

We first show that a* and are not the same applicant. Otherwise, Gl' = {a^,Cq ,...)— 
obtained from G1 by discarding all courses and applicants that appear before a^ —is an 
augmenting path coalition w.r.t. in P. Clearly the matching obtained from by 
satisfying Pareto dominates as i < t, contradicting that p^ is a POM in P. 

In the remainder of the proof we assume that ad aP Let us first consider the case 
where is matched to a course in (7“'^ / under u*, and therefore by Lemma 5 to a 

course c G ^curr{ap Under Let (t' = {c,a^,Cq,...), i.e., £' is obtained from £ by 

discarding all courses and applicants that appear before a^ and replacing them with c. 
Since £ < curr{a^), we have Cqi^^jC. It is then easy to see that £' is an alternating path 
coalition w.r.t. p''~^ in P~^, and hence p''~^ is not a POM in P~^, a contradiction. We 
now consider the case where is exposed in p^, and hence exposed in p^~^ w.r.t. P~^. 
Let el' = {a^ ,Cq, ...), i.e., Gl' is obtained from G1 by discarding all courses and applicants 
that appear before aP It is clear that £' is an augmenting path coalition w.r.t. p^~^ in 
P~^, contradicting that p' is a POM in P~^. □ 

Next we show that GSDT can produce any POM. Our proof makes use of a subgraph 
of the extended envy graph of Definition 1. 

Theorem 10. Given a CA instance I and a POM p, there exists a suitable priority 
ordering over applicants S given which GSDT can produce p. 

Proof. Given an instance I and a POM p, let G = (V, E) be a digraph such that V = 
{ac : (a, c) G p} and there is an arc from ac to a'c' if a / o', c' ^ p{a) and c' c. An arc 
{ac,a'd) has weight —1 if a prefers c' to c and has weight 0 if she is indifferent between 
the two courses. Note that G is a subgraph of the extended envy graph G{p) introduced 
in Definition 1. We say that ac envies a'c' if (ac, a'c') has weight —1. 

Note that if there exists an applicant-course pair ac G p and a course c' such that c'^aC 
and ac' ^ p, then c' must be full under p, or else p admits an alternating path coalition 
and is not a POM. 

Moreover, all arcs in any given strongly connected component (SGG) of G have weight 
0. To see this, note that by the definition of a SCO, there is a path from any node to 
every other node in the component. Hence, if there is an arc in a SCO with weight —I, 
then there must be a cycle of negative weight in that SCO. It is then straightforward to 
see that, by Lemma 1, p admits a cyclic coalition, a contradiction. 

It then follows that if c'^aC, then ac' and ac cannot belong to the same SGG. Should 
that occur, there would be an arc (of weight 0) from ac' to some vertex a'c* in the same 
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see, implying that c* ~a c' and thus c*)~aC- This in turn would yield that there is an arc 
of weight —1 from ac to a'c* in this See, a contradiction. 

We create the condensation G' of the graph G. It follows from the definition of Sees 
that G' is a DAG. Hence G' admits a topological ordering. Let X' be a reversed topological 
ordering of G' and X be an ordering of the vertices in G that is consistent with X' (the 
vertices within one SGG may be ordered arbitrarily). Let S be an ordering over applicant 
copies that is consistent with X (we can think of it as obtained from X by removing the 
courses). We show that GSDT can produce jj, given S. Note that technically S must 
contain b{a) copies of each applicant a. However, as // is Pareto optimal, upon obtaining 
// the algorithm will not be able to allocate any more courses to any of the applicants, 
even if we append more applicant copies to the end of S. 

We continue the proof with an induction. Recall that X is an ordering over the matched 
pairs in We let W(a*) = c where a*c is the ith element of X. Let /x* denote the matching 
that corresponds to the first i elements of X; hence = fj,. We claim that given S, 
GSDT is able to produce at stage i, after augmenting through an appropriate 
augmenting path. 

For the base case, note that a^X(a^) does not envy any other vertex in G and hence it 
can only be that X{a^) G C^i. It is then easy to see that the path (u, a^, (o^, l),X(a^),r) 
is a valid augmenting path in and hence GSDT might choose it. 

Assume that GSDT produces /x^ at the end of each stage £ for al\ I < i. We prove that 
it can produce /x* at the end of stage i. Assume, for a contradiction, that this is not the 
case. Let r denote the indifference class of a* to which X(a*) belongs. Note that course 
X(a*) is not full in /*, so if the path (a, a*, {a^ , r) , X (a^) , t) is not chosen by GSDT, it must 
be the case that GSDT finds an augmenting path in X*’* for some t < r. Let C denote the 
corresponding augmenting path coalition with respect to the matching which would 

be of the form 

where G C*. and Cj^ is exposed in It follows, from the definition of an augmenting 
path, that there is an edge in G for all /c, 1 < k < y — 2. Furthermore, 

as Cj-^y, there is an edge of weight —1 in G from a®X(a*) to therefore 

belongs to a SGG of higher priority than the one to which a*X(a*) belongs. If Cj^ is exposed 
in /X, then C' that is obtained by adding X{a£) to the beginning of d is an alternating path 
coalition in y, a contradiction to y being a POM. Therefore Cj^ must be full in y. 

As Cjy is not full in and full in /x, there must exist an a^, z > i, such that 
{a^,Cjy) G y. It follows, from the augmenting path coalition C, that there is a path in 
G from to a^cj^. If there is also a path from to then the two vertices 

belong to the same SGG; as belongs to a SGG of higher priority than the one to 

which a*X(a*) belongs, so must a" Cjy, implying that a^cjy must have appeared before 
a*X(a*) in X, a contradiction to z > i. If there is no such a path, then a^Cjy belongs to a 
SGG that is prioritized even over the SGG to which belongs, and hence must have 

appeared before a*X(a*) in X, a contradiction to z > i. □ 

5 Truthfulness of mechanisms for finding POMs 

It is well-known that the SDM for HA is truthful, regardless of the given priority ordering 
over applicants. We will show shortly that GSDT is not necessarily truthful, but first 
prove that this property does hold for some priority orderings over applicants. 

Theorem 11. GSDT is truthful given S if, for each applicant a, all occurrences of a in 
S are consecutive. 
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Ii with /ii I2 with /i2 


01 
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^ © 
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: C2 


Cl 

02 
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02 

: Cl 


C2 


/a with ^2 Ia 


Figure 2 ; Four instances of CA used in the proof of Theorem 14 . In all four instances 
b{ai) = 2 , 6(02) = 1 , q{ci) = q{c 2 ) = 1 - For each of instances Ii to Is, a matching is 
indicated using circles in applicants’ preference lists. 


Proof. Without loss of generality, let the applicants appear in S in the following order: 


Oi, Oi, . . . , ai, 02, 02, 


, a 2 ..., a^-i, a^-i, 


6(ai)-times 


6(a2)-times 


6(ai_i)-times 


, 0-2—0-2 5 ^2? • • • 5 • 


6(ai)-times 


Assume to the contrary that some applicant benehts from misrepresenting her preferences. 
Let Oj be the hrst such applicant in S who reports P'{ai) instead of P(oi) in order to beneht 
and V' = (P'(aj),'P(—Oj)). Let // denote the matching returned by GSDT using ordering 
S on instance I = (A, C, "P, b, q) (i.e. the instance in which applicant o* reports truthfully) 
and ^ the matching returned by GSDT using S but on instance I’ = {A,C,V',b,q). Let 
s = (S£<j 6 (o£)) + 1 , i.e., s is the first stage in which our mechanism considers applicant 
Oj. Let j be the first stage of GSDT such that Oj prefers to , where s < j < s + 6 ( 0 *). 

Given that applicants ai,, Oj-i report the same in I as in I' and all their occurrences 
in S are before stage j, Lemma 5 yields ioi £ = 1 , 2 ,... ,i — 1 . Also 

(ai) = {af) = % loT £ = i + l,i + 2 ,... ,ni, since no such applicant has been considered 
before stage j. But then, all applicants apart from a* are indifferent between and , 
therefore a* preferring to implies that is not a POM in D, a contradiction to 
Theorem 7. 

□ 


The next result then follows directly from Theorem 11 . 

Corollary 12. GSDT is truthful if all applicants have quota equal to one. 

There are priority orderings for which an applicant may benefit from misreporting her 
preferences, even if preferences are strict. This phenomenon has also been observed in a 
slightly different context [6]. Let us also provide an example. 

Example 13. Consider a setting with applicants ai and 02 and courses ci and C 2 , for 
which b{ai) = 2, 6(02) = 1 , q{ci) = 1, and q{c 2 ) = 1 . Let I be an instance in which 
C 2 '^aT_ci and 02 finds only ci acceptable. This setting admits two POMs, namely pLi = 
{(ai, C2), (02, Cl)} and H 2 = {(ai, ci), (ai, C2)}. GSDT returns pLi for Yi = (01,02,01). If 
oi misreports by stating that she prefers ci to C 2 , GSDT returns fX 2 instead of jj-i. Since 
GSDT is not truthful given S. 

The above observation seems to be a deficiency of GSDT. We conclude by showing 
that no mechanism capable of producing all POMs is immune to this shortcoming. 

Theorem 14. There is no universally truthful randomized mechanism that produces all 
POMs in CA, even if applicants’ preferences are strict and all courses have quota equal to 
one. 

Proof. The instance Ii in Figure 2 admits three POMs, namely fii = {(oi,ci), 

(02,02)}, 1x2 = {(oi,ci), (oi,C2)} and = {(oi, C2), (02, ci)}. Assume a randomized 
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mechanism (p that produces all these matchings. Therefore, there must be a deterministic 
realization of it, denoted as (p^, that returns given Ii. Let us examine the outcome of 
(p^ under the slightly different applicants’ preferences shown in Figure 2, bearing in mind 
that (p^ is truthful. 

• Under I 2 , (p^ must return ^2- The only other POM under I 2 is jJs, but if (p^ returns 
/U3, then 02 under Ji has an incentive to lie and declare only ci acceptable (as in 12)- 

• Under /a, cp^ must return ^ 2 - The only other POM under /a is but if (p^ returns 
;U3, then oi under has an incentive to lie and declare that she prefers ci to C2 (as 
in 12). 

/4 admits two POMs, namely ^2 and If cp^ returns iJ, 2 , then oi under Ji has an 
incentive to lie and declare that she prefers C2 to ci (as in I 4 ). If (p^ returns ^ 3 , then 
02 under has an incentive to lie and declare C2 acceptable—in addition to ci—and 
less preferred than ci (as in I 4 ). Thus overall (p^ cannot return a POM under 14 while 
maintaining truthfulness. □ 

6 Future work 

Our work raises several questions. A particularly important problem is to investigate the 
expected size of the matching produced by the randomized version of GSDT. It is also 
interesting to characterize priority orderings that imply truthfulness in GSDT. Conse¬ 
quently, it will be interesting to compute the expected size of the matching produced by 
a randomized GSDT in which the randomization is taken over the priority orderings that 
induce truthfulness. 
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